A Semi-Automated Workflow Solution for Data Set Publication
نویسندگان
چکیده
To address the need for published data, considerable effort has gone into formalizing the process of data publication. From funding agencies to publishers, data publication has rapidly become a requirement. Digital Object Identifiers (DOI) and data citations have enhanced the integration and availability of data. The challenge facing data publishers now is to deal with the increased number of publishable data products and most importantly the difficulties of publishing diverse data products into an online archive. The Oak Ridge National Laboratory Distributed Active Archive Center (ORNL DAAC), a NASA-funded data center, faces these challenges as it deals with data products created by individual investigators. This paper summarizes the challenges of curating data and provides a summary of a workflow solution that ORNL DAAC researcher and technical staffs have created to deal with publication of the diverse data products. The workflow solution presented here is generic and can be applied to data from any scientific domain and data located at any data center.
منابع مشابه
Towards Semi-automated Workflow-Based Aggregation of Web Services
Abstract. Service aggregation is one of the main issues in the emerging area of service-oriented computing. The aim of this paper is to contribute to the long-term objective of lifting service aggregation from manual hand-crafting to a semi-automated engineered process. We present a methodology which, given a set of service contracts, tries to construct an aggregation of such services. Service ...
متن کاملDynamic configuration and collaborative scheduling in supply chains based on scalable multi-agent architecture
Due to diversified and frequently changing demands from customers, technological advances and global competition, manufacturers rely on collaboration with their business partners to share costs, risks and expertise. How to take advantage of advancement of technologies to effectively support operations and create competitive advantage is critical for manufacturers to survive. To respond to these...
متن کاملSemi-Supervised Never-Ending Learning in Rhetorical Relation Identification
Some languages do not have enough labeled data to obtain good discourse parsing, specially in the relation identification step, and the additional use of unlabeled data is a plausible solution. A workflow is presented that uses a semi-supervised learning approach. Instead of only a predefined additional set of unlabeled data, texts obtained from the web are continuously added. This obtains near...
متن کاملQuantitative Cross-linking/Mass Spectrometry Using Isotope-labeled Cross-linkers and MaxQuant*
The conceptually simple step from cross-linking/mass spectrometry (CLMS) to quantitative cross-linking/mass spectrometry (QCLMS) is compounded by technical challenges. Currently, quantitative proteomics software is tightly integrated with the protein identification workflow. This prevents automatically quantifying other m/z features in a targeted manner including those associated with cross-lin...
متن کاملConstruction of biological networks from unstructured information based on a semi-automated curation workflow
Capture and representation of scientific knowledge in a structured format are essential to improve the understanding of biological mechanisms involved in complex diseases. Biological knowledge and knowledge about standardized terminologies are difficult to capture from literature in a usable form. A semi-automated knowledge extraction workflow is presented that was developed to allow users to e...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- ISPRS Int. J. Geo-Information
دوره 5 شماره
صفحات -
تاریخ انتشار 2016